FALCON: Federated Active Linguistic data CuratiON
نویسنده
چکیده
FALCON assembles a state of the art online translation tool chain that combines web site translation, translation management, computer aided translation and terminology management products. This tool chain tool chain has been enhanced with open-source automatic term extraction and machine translation technology. Iterative quality improvement in this language technology is delivered by using linked data to actively manage the curation and reuse of language resources within customer projects. The work demonstrates the integration the management of iterative SMT training with active curation of MT corrections and target term capture by post-editors in a live localisation workflow using this commercial tool chain. Key capabilities offered are:
منابع مشابه
Study of the foundation, models and issues of research data curation and management in scientific and academic environments
Background and Aim: The purpose of this paper is to study, identifying and discuss the foundation and concepts, models and frameworks, dimensions and challenges of research data curation and management in scientific and academic environments. Method: This article is a review article and library method was used to collect scientific and research texts in this field. In this research, external an...
متن کاملTraining in Data Curation as Service in a Federated Data Infrastructure - The FrontOffice-BackOffice Model
The increasing volume and importance of research data leads to the emergence of research data infrastructures in which data management plays an important role. As a consequence, practices at digital archives and libraries change. In this paper, we focus on a possible alliance between archives and libraries around training activities in data curation. We introduce a so-called FrontOffice–BackOff...
متن کاملFalconAO: Aligning Ontologies with Falcon
Falcon-AO is an automatic tool for aligning ontologies. There are two matchers integrated in Falcon-AO: one is a matcher based on linguistic matching for ontologies, called LMO; the other is a matcher based on graph matching for ontologies, called GMO. In Falcon-AO, GMO takes the alignments generated by LMO as external input and outputs additional alignments. Reliable alignments are gained thro...
متن کاملThe ecology of documentary and descriptive linguistics1
The primary goal of developing this model will be to facilitate the characterization of tools and standards for digital linguistic resources with respect to the entire documentary and descriptive process in order to (i) help researchers avoid duplicating the work of others unnecessarily and (ii) ensure that linguistics, as a discipline, does not accidentally focus on particularly salient domain...
متن کاملGlobal Intelligent Content: Active Curation of Language Resources using Linked Data
As language resources start to become available in linked data formats, it becomes relevant to consider how linked data interoperability can play a role in active language processing workflows as well as for more static language resource publishing. This paper proposes that linked data may have a valuable role to play in tracking the use and generation of language resources in such workflows in...
متن کامل